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Abstract 

We consider the problem of detecting a cycle in a directed graph that grows by arc insertions, 
and the related problems of maintaining a topological order and the strong components of such 
a graph. For these problems we give two algorithms, one suited to sparse graphs, the other to 
dense graphs. The former takes (9(min{;7i'/^,n^/^}m) time to insert m arcs into an «-vertex 
graph; the latter takes 0{n^\ogn) time. Our sparse algorithm is considerably simpler than a 
previous 0(m^/^)-time algorithm; it is also faster on graphs of sufficient density. The time 
bound of our dense algorithm beats the previously best time bound of 0{ir'/^) for dense graphs. 
Our algorithms rely for their efficiency on topologically ordered vertex numberings; bounds on 
the size of the numbers give bounds on running time. 

1 Introduction 

Perhaps the most basic algorithmic problem on directed graphs is cycle detection. We consider 
an incremental version of this problem: given an initially empty graph that grows by on-line arc 
insertions, report the first insertion that creates a cycle. We also consider two related problems, 
that of maintaining a topological order of an acyclic graph as aixs are inserted, and maintaining the 
strong components of such a graph. 

We use the following terminology. We order pairs lexicographically: a, b < c,d if and only if 
either a < b, or a = b and c < d. We denote a list by square brackets around its elements; "[ ]" 
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denotes the empty list. We denote list catenation by "&". In a directed graph, we denote an arc 
from V to w by (v,w). We disallow multiple arcs and loops (arcs of the form (v,v)). We assume 
that the set of vertices is fixed and known in advance, although our results extend to handle on-line 
vertex insertions. We denote by n and m the number of vertices and arcs, respectively. We assume 
that m is known in advance; our results extend to handle the alternative. To simplify expressions for 
bounds we assume n> \ and m = Q.{n); both are true if there are no isolated vertices. A vertex v is 
a predecessor of w if (v, w) is an arc. The size size{w) of a vertex w is the number of vertices v such 
that there is a path from v to w. Two vertices, two arcs, or a vertex and an arc are related if they 
are on a common path, mutually related if they are on a common cycle (not necessarily simple), 
and unrelated if they are not on a common path. Relatedness is a symmetric relation. The strong 
components of a directed graph are the subgraphs induced by the maximal subsets of mutually 
related vertices. 

A dag is a directed acyclic graph. A weak topological order < of a dag is a partial order of the 
vertices such that if (v, w) is an arc, v < w; a topological order of a dag is a total order of the vertices 
that is a weak topological order. A weak topological numbering of a dag is a numbering of the 
vertices such that increasing numeric order is a weak topological order; a topological numbering 
of a dag is a numbering of the vertices from 1 through n such that increasing numeric order is a 
topological order. 

There has been much recent work on incremental cycle detection, topological ordering, and 
strong component maintenance llTT43ll7l[8l [T0Ul^[T5l[T6l[T9ll2TI . For a thorough discussion of this 
work see ||7J|3 ; here we discuss the heretofore best results and others related to our work. A classic 
result of graph theory is that a directed graph is acycUc if and only if it has a topological order |[23l ; 
a more recent generalization is that the strong components of a directed graph can be ordered topo- 
logically (so that every arc lies within a component or leads from a smaller component to a larger 
one) 191. For static graphs, there are two C?(m)-time algorithms to find a cycle or a topological order: 
repeated deletion of vertices with no predecessors II131I14II and depth-first search |[24l : the reverse 
postorder f25l defined by such a search is a topological order if the graph is acyclic. Depth-first 
search extends to find the strong components and a topological order of them in 0{m) time ll24l 

For incremental cycle detection, topological ordering, and strong component maintenance, there 
are two known fastest algorithms, one suited to sparse graphs, the other suited to dense graphs. Both 
are due to Haeupler et al. ITHH. Henceforth we denote the coauthors of these papers by HKMST. 
The HKMST sparse algorithm takes 0{irr'l^) time for m arc additions; the HKMST dense algorithm 
takes 0{n^l'^) time. Both of these algorithms use two-way search; each is a faster version of an 
older algorithm. These algorithms, and the older ones on which they ai^e based, bound the total 
running time by counting the number of arc pairs or vertex pairs that become related as a result 
of aix insertions. The HKMST sparse algorithm uses a somewhat complicated dynamic list data 
structure ||4l|6l to represent a topological order, and it uses either linear-time selection or random 
sampling to guide the searches. There are examples on which the algorithm takes Q.{nm^l^) time, 
so its time bound is tight for sparse graphs. The time bound of the HKMST dense algorithm is not 
known to be tight, but there are examples on which it takes n(?2^2'V^^'^") Q. 

Our approach to incremental cycle detection and the related problems is different. We maintain 
a weak topological numbering and use it to facilitate cycle detection. Our algorithms pay for cycle- 
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detecting searches by increasing the numbers of appropriate vertices; a bound on the numbers gives 
a bound on the running time. One insight is that the size function is a weak topological numbering. 
Unfortunately, maintaining this function as arcs are inserted seems to be expensive. But we are able 
to maintain in 0{n^\ogn) time a weak topological numbering that is a lower bound on size. This 
gives an incremental cycle detection algorithm with the same running time, substantially improving 
the time bound of the HKMST dense algorithm. Our algorithm uses one-way rather than two-way 
search. For sparse graphs, we use a two-part numbering scheme. The first part is a scaled lower 
bound on size, and the second part breaks ties. This idea yields an algorithm with a running time of 
0(min{m^/^,n^/^}m). Our algorithm is substantially simpler than the HKMST sparse algorithm and 
asymptotically faster on sufficiently dense graphs. The 0{n^\ogn) algorithm appeared previously 
in in, but the other algorithm is new to this paper. 

The remainder of our paper consists of four sections. Section|2]describes the two versions of our 
cycle-detection algorithm for sparse graphs. Section |3] describes our cycle-detection algorithm for 
dense graphs. Section|4]describes several simple extensions of the algorithms. Section|5]extends the 
algorithms to maintain the strong components of the graph as arcs are inserted instead of stopping 
as soon as a cycle exists. The extensions in Sections |4] and |5] preserve the asymptotic time bounds 
of the algorithms. Section |6] contains concluding remarks. 

2 A Two-Way-Search Algorithm for Sparse Graphs 

Our algorithm for sparse graphs uses two-way search. Unlike the entirely symmetric forward and 
backward searches in the HKMST sparse algorithm, the two searches in our algorithm have different 
functions. Also unlike the HKMST sparse algorithm, our algorithm avoids the use of a dynamic list 
data structure, and it does not use selection or random sampling: all of its data structures are simple, 
as is the algorithm itself. 

We use a two-part numbering scheme whose lexicographic order is topological. Specifically, we 
partition the vertices into levels. We maintain a weak topological numbering of the levels; within 
each level, we give the vertices indices ordered topologically. Each backward search proceeds 
entirely within a level. If the search takes too long, we stop it and increase the level of a vertex. 
This bounds the backwai^d search time. Each forwai^d search traverses only aixs that lead to a lower 
level, and it increases the level of each vertex visited. An overall bound on such increases gives a 
bound on the time of all the forward searches. If the backward and forward searches do not detect a 
cycle, we update vertex indices to restore topological order. To facilitate this, we make the searches 
depth-first. 

Each vertex v has a level k{v) and an index i{v). Indices ai^e distinct. While the graph remains 
acyclic, lexicographic order on level and index is a topological order. That is, if {x,y) is an arc, 
k{x),i{x) < k{y),i(y). Levels are positive integers and indices are negative integers. We make 
indices negative because newly assigned indices must be smaller than old ones. An alternative is to 
maintain the negatives of the indices and reverse the sense of all index comparisons. Initially, each 
vertex v has ^(v) = 1 and /(v) an integer between —n and —1 inclusive, distinct for each vertex. 

In addition to levels and indices, we maintain a variable index equal to the smallest index as- 
signed so far. To represent the graph, we maintain for each vertex v the set out(y) of outgoing arcs 
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(v,w) and the set in{v) of incoming arcs {u,v) such that k{u) = k{v). Initially index = —n and all 
incident arc sets are empty. Each backward search marks the vertices it visits. Initially all vertices 
are unmarked. To bound backward searches, we count aic traversals. Let A = min|m^/^,/i^/^}. 
Recall that we denote a list with square brackets "[ ]" around its elements and list concatenation 
with "&." 

The algorithm for inserting a new arc (v,>v) consists of the following steps: 
Step 1 (test order): If ^(v) , i{v) < k{w) , i{w) go to Step 5 (lexicographic order remains topological). 
Step 2 (search backward): Let S [ ], F [ ], and arcs ^ 0, where ^ denotes the assignment 
operator. Do Bvisit(v), where mutually recursive procedures BvisiT and Btraverse are defined 
as follows: 

Bvisit(j) 

1 mark J 

2 for {x,y) G m{y) 

3 do B TRAVERSE (x, 3;) 

4 B^B&[y] 

B TRAVERSE 

1 ifx = w 

2 then stop the algorithm and report the detection of a cycle. 

3 arcs ^ arcs + 1 

4 if arcs > A 

5 then (The search ends, having traversed at least A arcs without reaching w.) 

6 k{w)^k{v) + \ 
1 in{w) ^ {} 

8 B^[] 

9 unmark all marked vertices 

10 go to Step 3 (aborting the backward search) 

11 if X is unmarked 

12 then Bvisit(x) 

If the search ends without detecting a cycle or traversing at least A arcs, test whether k{w) = k(y). 
If so, go to Step 4; if not, let k{w) = k{v) and in{w) = {}. 

Step 3 (forward search): Do Fvisit(w), where mutually recursive procedures FviSiT and Ftra- 
VERSE are defined as foUows: 

F visit (x) 

1 for {x,y) G out{x) 

2 do Ftraverse(x,j) 

3 F ^ [x]&F 
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Ftraverse(x,3;) 

1 if J = V or J is in S 

2 then stop the algorithm and report the detection of a cycle 

3 if k{y)<k{w) 

4 then k{y) ^ k{w) 

5 in{y) ^ {} 

6 Fvisit(3;) 

7 [> Now, k{y) > k{w) 

8 \fk{y)=k{w) 

9 then add {x,y) to in{y) 

Step 4 (re-index): Let L = B&F. While L is nonempty, let index = index — I, delete the last vertex 
X on L, and let i{x) = index. 

Step 5 (insert arc): Add (v,w) to out{v). If ^(v) = k{w), add (v,w) to /?i(w). 

Theorem \ If a new arc creates a cycle, the insertion algorithm stops and reports a cycle. If not, 
lexicographic order on level and index is a topological order. 

Proof. By inspection, the algorithm correctly maintains the incident arc sets and the value of 
index. We prove the theorem by induction on the number of arc insertions. Initially, lexicographic 
order on level and index is a topological order since there are no arcs: any total order is topo- 
logical. Suppose the lexicographic order is topological just before the insertion of an arc (v,w). 
If ^(v),/(v) < k{w),i{w) before the insertion, then lexicographic order on level and index remains 
topological. Thus assume k{v),i{v) > k(w),i{w). 

Clearly, if the algorithm stops and reports a cycle, there is one. Suppose the insertion of (v, w) 
creates a cycle. Such a cycle consists of the arc (v,w) and a pre-existing path from w to v, along 
which levels are nondecreasing before he insertion. If v and w have the same level, then all vertices 
on the path have the same level, and either the backward search wiU traverse the entire path and 
report a cycle, or it will report a different cycle, or it will stop, w will increase in level, and the 
algorithm will proceed to Step 3. If v has larger level than w, w will increase in level in Step 2, 
and the algorithm will do Step 3. Suppose the algorithm does Step 3. At the beginning of this step, 
vertex w has maximum level on the cycle, and B is the set of vertices from which v is reachable 
by a path all of whose vertices have level k{w). (Either k{w) = k{v), in which case v is in B, or 
k{w) = k{v) + 1, and B = [].) Every vertex on the cycle that is not w and not in B must have level 
less than k{w), and the forward search from w will eventually visit each such vertex, traversing the 
cycle forward, until traversing an arc {x,y) with y = v ot y in B and reporting a cycle. We conclude 
that the algorithm reports a cycle if and only if an arc insertion creates one. 

Suppose the insertion of (v,w) does not create a cycle. After the backward search stops, B 
contains the vertices from which v is reachable by a path all of whose vertices have level k{w). 
Also, if {x,y) is an arc with x not in B, but y in B, k{x) < k{y). Step 3 increases to k{w) the level 
of every vertex in F. After the re-indexing in Step 4, consider any arc {x,y). If neither x nor y is 
in B&F, k{x),i{x) < k{y),i{y), since this was true before the insertion. If both x and y are in B&F, 
k{x),i{x) < k{y),i{y) because B&F is in topological order. If j is in S but x is not in B&F, then 
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k{x) < k{y). If 3^ is in but x is not in B&F, then k{x) < k{y) since y increased in level but x did 
not, and k{x) < k{y) before the arc insertion. If x is in B&F but y is not in B&F, then k{x) < k{y) 
and i{x) < i{y) after the insertion. We conclude that after the insertion, lexicographic order on level 
and index is topological □ 

Lemma 2 The algorithm assigns no index less than —nm — n. 

Proof. All initial indices are at least —n. Each arc insertion decreases the minimum index by at 
most n, so after m insertions the minimum index is at least —nm — n. □ 

Lemma 3 No vertex level exceeds min { m^/^ , n^/^ } + 2. 

Proof. Fix a topological order just before the last arc insertion. Let > 1 be a level assigned 
before the last arc insertion, and let w be the lowest vertex in the fixed topological order assigned 
level k. For w to be assigned level k, the insertion of an arc (v, w) must cause a backward search 
from V that traverses at least A arcs both ends of which are on level k—\. All the ends of these 
arcs must still be on level ^ — 1 just before the last insertion. Thus these sets of arcs are distinct for 
each k, as are their sets of ends. Since there are only m arcs, there are most m/A distinct values of 
k. Also, for each k there must be at least \/A distinct arc ends, since there are no loops or multiple 
arcs. Since there are only n vertices, there are at most «/ \/A distinct values of k. It follows that no 

vertex level exceeds min |ni/A, «/ \/a| + 2, which gives the lemma. □ 

Theorem 4 The insertion algorithm takes C?(min {wi^/^, } m) time for m arc insertions. 

Proof. By Lemmas|2]and[3l all levels and indices are polynomial in n, so assignments and compar- 
isons of levels and indices take 0(1) time. Each backward search takes 0(A) = 0(min {?n^/^,?i^/^|) 
time. The time spent adding and removing arcs from incidence sets is 0(1) per arc added or re- 
moved. An arc can be added or removed only when it is inserted into the graph or when the level of 
one of its ends increases. By Lemma[3l this can happen at most 0(min {m'/^,?i^/^}) times per arc. 
The time for a forward search is 0(1) plus 0(1) per arc {x,y) such that x increases in level as the 
result of the arc insertion that triggers the search. By Lemma [3l this happens 0(min{ni^/^,?i^/^}) 
times per arc. □ 



The space needed by the algorithm is 0(m). 

Theorem 5 For any n and m with m < n{n— l)/2, there exists a sequence of m arc insertions 
causing the algorithm to run in Q.{vam {m'/^ , n^/^ } m) total time. 

Proof. Assume without loss of generality that m > 2n and n is sufficiently large. Let the vertices 
be 1 through n, numbered in the initial topological order. We first add arcs consistent with the 
initial order (so that no reordering takes place) to construct a number of cliques of consecutive 
vertices. An r-clique of vertices k thi^ough /: + r— 1 is formed by adding aix [i,]) for ij such that 
k<i<i<k + r—\. An r-clique consists of r vertices and r(r — l)/2 arcs. 
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Let ri = Y^Jmll\ . Construct an ri -clique of the first r\ vertices. This is the main clique. The 
main clique contains at most n/2 vertices and at most m/4 arcs. Let r2 = VA+l ■ Starting 
with vertex r\ + 1, construct r2-cliques on disjoint sets of consecutive vertices, until running out of 
vertices or until \m/2\ arcs have been added, including those added to make the main clique. Each 
of the r2-cliques is an anchor clique. The number of arcs in each anchor clique is 0(A) and at least 
A. Number the anchor cliques from 1 though k in increasing topological order. Then k = 0(A). So 
fai" there has been no vertex reordering, and all vertices have level 1 . 

Next, for j from k—\ thi^ough 1 in decreasing order, add arcs from the last vertex of anchor 
clique j + 1 to each vertex of anchor clique j. Add these arcs in decreasing topological order with 
respect to the end of the arc that is in anchor clique j. There are at most n/2 < m/4 such arc 
additions. Each addition of an arc from the last vertex of anchor clique ^ to a vertex w in anchor 
clique k—\ triggers a backward search that traverses at least A arcs and causes the level of w to 
increase from 1 to 2. Each forward search visits only a single vertex. Once all arcs from anchor 
clique k are added, all vertices in anchor clique k — \ have level 2. Addition of the arcs from the 
last vertex of anchor clique ^ — 1 to the vertices in anchor clique k — 2 moves all vertices in anchor 
clique ^ — 2 to level 3. After all the arcs between anchor cliques are added, every vertex in anchor 
clique j is on level k + \— j. The number of arcs added to obtain these level increases is at most 
n/2< m/4. 

Finally, for each anchor clique from k — 2 through 1 in decreasing order, add an arc from its first 
vertex in topological order to the first vertex in the main clique. There are at most n/2 < m/4 such 
arc additions. Each addition triggers a backward search that visits only one vertex, followed by a 
forward search that traverses all the arcs in the main clique and increases the level of all vertices in 
the main cUque by one. These forward searches do ©(Am) arc traversals altogether. At most m arcs 
are added during the entire construction. □ 



3 A One- Way-Search Algorithm for Dense Graphs 

The two-way-search algorithm becomes less and less efficient as the graph density increases; for 
sufficiently dense graphs, one-way-search is better. In this section we present a one-way search 
algorithm that takes 0{n^logn) time for all arc insertions. The algorithm maintains for each vertex 
V a level k{v) that is a weak topological numbering satisfying k{v) < size{v). The algorithm pays 
for its searches by increasing vertex levels, using the following lemma to maintain ^(v) < size{v) 
for all V. 

Lemma 6 In an acyclic graph, if a vertex v has j predecessors, each of size at least s, then size{v) > 
s + j. 

Proof. Order the vertices of the graph in topological order and let u be the smallest predecessor 
of V. Then size{v) > size{u) + j > s + j. Here "+/' counts v and the j —I predecessors of v other 
than u. □ 
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The algorithm uses Lemma [6] on a hierarchy of scales. For each vertex v, in addition to a level 
^:(v), it maintains a bound bi{v) and a count c,(v) for each integer /,0 < / < [lg«J, where Ig is the 
base-2 logarithm. Initially ^(v) = 1 for all v, and bj{v) = c,(v) = for all v and /. To represent the 
graph, for each vertex v the algorithm stores the set of outgoing arcs (v, w) in a heap (priority queue) 
out{v), each arc having a priority that is at most k{w). (This priority is either k{w) or a previous 
value of k{w).) Initially all such heaps ai^e empty. 

The arc insertion algorithm maintains a set of arcs A to be traversed, initially empty. To insert 
an arc (v, w), add (v, w) to A and repeat the following step until a cycle is detected or A is empty: 

Traversal Step: 

1 delete some arc {x,y) from A 

2 \iy = v 

3 then stop the algorithm and report a cycle 

4 i^k{x)>k{y) 

5 then k{y) ^ k{x) + 1 

6 else [> k{x) < k{y) 

7 /^Llg(^(3;)_^(^))J 

8 Ci{y)^Ci{y) + \ 

9 if c;(j) = 3-2'+i 

10 thenc,(y)^0 

11 k{y)^max{k{y),bi{y) + ^-2'} 

12 bi{y)^k{y)-2'+\ 

13 delete from out{y) every arc with priority at most k{y) and add these arcs to A. 

14 add {x,y) to out{x) with priority k{y). 

In a traversal step, an arc {y,z) that is deleted from out{y) may have k{z) > k{y), because k{z) 
may have increased since {y,z) was last inserted into out{y). Subsequent traversal of such an arc 
may not increase k(z). It is to pay for such traversals that we need the mechanism of bounds and 
counts. 

We implement each heap out{v) as an array of buckets indexed from 1 thi^ough n, with bucket / 
containing the arcs with priority /. We also maintain the smallest index of a nonempty bucket in the 
heap. This index never decreases, so the total time to increment it over all deletions from the heap 
is 0{n). The time to insert an arc into a heap is 0(1). The time to delete a set of arcs from a bucket 
is 0(1) per arc deleted. The time for heap operations is thus 0(1) per arc traversal plus 0{n) per 
heap. Since there are n heaps, this time totals 0(1) per arc traversal plus 0{n^). 

To analyze the algorithm, we begin by bounding the total number of wc traversals, thereby 
showing that the algorithm terminates. Then we prove its correctness. Finally, we fill in one detail 
of the implementation and bound the running time. 

Lemma 7 While the graph remains acyclic, the insertion algorithm maintains k[v) < size (v) for 
every vertex v. 

Proof. The proof is by induction on the number of arc insertions. The inequality holds initially. 
Suppose it holds just before the insertion of an arc (v,w) that does not create a cycle. Consider a 
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traversal step during the insertion that deletes {x,y) from A and increases k{y). If k{y) increases to 
k{x) + 1, size{y) > 1 +size{x) > 1 +k{x), maintaining the inequality for y. The more interesting 
case is when Cj{y) = 3 • 2'+^ and k(y) increases to bi{y) + 3- 2'. Each of the increases to c,(j) since 
it was last zero corresponds to the traversal of an arc {z,y)- When Cj{y) was last zero, ^((3') = 
max ^0,k{y) — 2'+' }. Since k(y) cannot decrease, bj{y) < k{z) < size{z) when this traversal of {z^y) 
occurs, since at this time k{y) — k{z) < mm{k{y),2'^^Y We consider two cases. If there were 
at least 3 • 2' traversals of distinct arcs (z,y) since c,(y) was last zero, then size{y) > bi{y) + 3-2' 
by Lemma [6l and the increase in k{y) maintains the inequality for y. If not, by the pigeonhole 
principle there were at least three traversals of a single arc {z,y) since c;(y) was last zero. When 
each traversal happens, k{y) — k{z) > 2', but each of the second and third traversals cannot happen 
until k{z) increases to at least the value of k{y) at the previous traversal. This implies that when 
the third traversal happens, k{y) > bi{y) + 3-2', so k{y) will not in fact increase as a result of this 
traversal. □ 

Lemma 8 If a new arc (v,w) creates a cycle, the insertion algorithm maintains k[v) < siz.e{v) +n, 
where sizes are before the addition o/(v,w). 

Proof. Before the addition of (v,w), ^(v) < size{v) for every vertex v, by Lemma|7] Traversal of 
the arc {v,w) can increase ^(v) by at most n, so the desired inequality holds after this traversal. Every 
subsequent traversal is of an arc other than (v,w): to traverse (v, w), an arc into v must be traversed, 
which results in reporting of a cycle. Thus the subsequent traversals are of arcs in the acyclic graph 
before the addition of (v,w). The proof of Lemma |7] extends to prove that these traversals maintain 
the desired inequality: Lemma|6]holds if the size function is replaced by the size plus any constant, 
in particular by the size plus n. □ 

Lemma 9 The total number of arc traversals over m arc additions is 0{n^\ogn). 

Proof By Lemmas |7] and [H every label ^(v), and hence every bound bi{v), remains below 2n. 
Every arc traversal increases a vertex level or increases a count. The number of level increases is 
0{n^). Consider a count c,(v). Each time c;(v) is reset to zero from 3 -2'+^, bi{v) increases by at 
least 2'. Since bi{v) < 2n, the total amount by which c,(v) can decrease as a result of being reset is at 
most 12«. Since c, (v) starts at zero and cannot exceed 4«, the total number of times c, (v) increases 
is at most I6n. Summing over all counts for all vertices gives a bound of O(n^logn) on the number 
of count increases and hence on the number of arc traversals. □ 

Theorem 10 If the insertion of an arc (v, w) creates a cycle, the insertion algorithm stops and 
reports a cycle. If not, the insertion algorithm maintains the invariant that k is a weak topological 
numbering. 

Proof. By Lemma |9] the algorithm terminates. A straightforward induction shows that every arc 
{x,y) traversed by the insertion algorithm is such that x is reachable from v, so if the algorithm 
stops and reports a cycle, there is one. Suppose the insertion of (v,w) creates a cycle. Before the 
insertion of (v,w), ^ is a weak topological numbering, so the path from w to v existing before the 
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addition of (v,w) has vertices in strictly increasing order. Thus v has the largest level on the path. 
A straightforward induction shows that the algorithm will eventually traverse every arc on the path 
and report a cycle, unless it reports another cycle first. 

Suppose addition of an arc (v,w) does not create a cycle. Before the addition, k is a weak 
topological numbering. The algorithm maintains the invariant that every arc {x,y) such that k{x) > 
k(y) is either on A or is the arc being processed. Thus, once A is empty, k is a weak topological 
numbering. □ 

Theorem 11 The algorithm runs in 0{n^\ogn) total time. 

Proof. The running time is 0(1) per arc traversal plus 0{n^). This is 0{n^\ogn) by Lemma|9] □ 

The space needed by the algorithm is 0{nlogn + m) for the labels, bounds, and counts, and 
0{n^) for the n heaps. Storing the heaps in hash tables reduces their total space to 0{m) but makes 
the algorithm randomized. By using a two-level data structure |[27l to store each heap, the space 
for the heaps can be reduced to 0{n^'^ +m) without using randomization. This bound is 0{m) if 
m/n = Q.{n^l^); if not, the sparse algorithm of Section 2 is faster 

The following theorem states that our analysis for this algorithm is tight. 

Theorem 12 For any sufficiently large n, there exists a sequence of&{n^) arc insertions that causes 
our algorithm to do Q.{n^\ogn) arc traversals. 

Proof. Without loss of generality, suppose n = (7/2)r — 3, where r > 2^ is a power of 2. The 
graph we construct consists of three categories of vertices: (1) vertices ui,U2, ■ ■ ■ ,Ur, (2) sets of 
vertices So,S\ , . . . ,5'ig(,.)_2 with \Sj\ = 3 • 2-'+' (so Y^j |Sy| = 3(r/2 — 1)), and (3) a set of vertices T 
with |r| = r. Initially there are no arcs in the graph, and all levels are 1. 

First, add arcs (m,',m,+i) in order for 1 < / < r. After these arc additions, k{ui) = i. These levels 
are invariant over the remainder of the arc insertions — we use these vertices as anchors to increase 
the levels of all the other vertices. In fact, the only time the level of any other vertex v G {[jjSj) U T 
will increase is when adding an arc (m,,v). 

The arc insertions proceed in phases ranging from 2 to r. In phase /, first insert arc (M,_i,f) 
for all f G r, thereby increasing k{t) to k{t) = i. Next, consider each j for which there exists a 
constant c > 3 such that / = c2^, i.e., / is a sufficiently large multiple of 2-'. There are two cases here, 
described in more detail shortly. If c = 3, insert arcs from Sj to T, not causing a level increase tot. If 
c> 3, the algorithm traverses the arcs from Sj to T again, but without causing any level increases to 
f G r. Moreover, the only time any Cj{t) or bj{t) changes, for j > 0, is when the algorithm traverses 
an arc from Sj to t £T. 

Case 1 (add arcs from Sj to T): If / = 3 • 2^ for some j, add arcs (M2j+i_i,5'y) for all sj G Sj, 
causing k{sj) to increase to 2^+^. Also add arcs {sj,t) for all sj G Sj and t £T. Observe that before 
these arc additions [lg{k{t) - k{sj))\ = Llg(2-''+^ -2-'')J = j. Moreover, Cj{t) = and bj{t) = 0. 
For each t, when the last arc insertion occurs, Cj{t) increases to 3 -2^+^ We have, however, that 
k{t) = 3- 2^+^ > bj + 3- 2-i, and hence k{t) does not increase. The counter cj (t) is subsequently reset 
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to and bj{t) = k{t) — 2^+' = k{sj) — iK Finally, the priority of each of these arcs {sj,t) is updated 
to 3 • 2^ in out{sj). 

Case 2 (follow arcs from Sj to T): Otherwise, / = c2^, for c > 3. Since / > 3 -2^, the arcs {sj,t) 
akeady exist. Before this step, we have k{sj) = k{t) — 2^+\ for each Sj G Sj. Moreover, we have 
bj{t) = k{sj) — 2^ = k{t) — 3 -2^ Insert arcs {ui_2i-i^Sj)' for £ ^j- Such an arc insertion causes 
k{sj) to increase to the next multiple of 2A After the update, we have k{sj) equal to the priority of 
each aix {sj,t) in out{sj), and hence the algorithm traverses each of the outgoing arcs. Moreover, 
\g{k{t) — k{sj)) = lg(/ — 2^) = j, and hence the counter cj is affected. For each t, the counter Cj{t) 
again reaches 3 • 2^+' . Since bj{t) = k{t) — 3 • 2\ the level of t again does not increase. The counter 
Cj{t) is subsequently reset to 0, each bj{t) = k{t) — 2^+' = k{sj) — 2\ and the priority of each of the 
arcs {sj,t) is set to k{t) in out{sj). 

In both cases, whenever the phase number / is a large enough multiple of 2^, the algorithm 
traverses all arcs {sj,t) such that Sj G Sj and ? e T. Consider a fixed / There are • |r| = 3 -2^+^ 
such arcs. Summing over all r/2^ — 2 phases during which the phase number is a large enough 
multiple of V, there are (3 • V+^r){r/V - 2) = Q.{r^) = Q.{n^) arc traversals from vertices in Sj to 
vertices in T . Summing over all lg{r) — 2 = 0(log«) values of j yields a total of Q.{n^\ogn) arc 
traversals. □ 

The proof extends to give a slightly more general result: for any 1 < ^ < Ig «, there is a sequence 
of 0(2*^«) arc insertions causing the algorithm to do @{rp-k) arc traversals. To prove this, omit from 
the proof of Theorem [l2]the sets Sj with j > k. The generalization implies that 0(?i) arcs are enough 
to make the algorithm take time, and 0(n^+^) arcs, for any constant £ > 0, are enough to make 
the algorithm take Q.{n^\ogn) time. 

4 Simple Extensions 

In this section we extend our sparse and dense algorithms to provide some additional capabilities 
possessed by previous algorithms. All the extensions are simple and preserve the asymptotic time 
bounds of the unextended algorithms. Our first extension eliminates ties in the vertex numbering 
maintained by the dense algorithm presented in Section [3l We break ties by giving each vertex a 
distinct index as in the sparse algorithm and ordering the vertices lexicographically by level and 
index. The indices can be arbitrary, as long as they are distinct within each level: we can use fixed 
indices, or we can assign new indices when vertices change level. 

Assigning new indices is useful in our second extension, which explicitly maintains a doubly- 
linked list of the vertices in lexicographic order by level and index, and hence in a topological order. 
We maintain a pointer to the first vertex on the list. We also maintain for each non-empty level a 
pointer to the last vertex on the level. We store these pointers in an an^ay indexed by level. When a 
vertex increases in level, we delete it from its current list position and re-insert it after the last vertex 
on its old level, unless it was the last vertex on its old level, in which case its position in the list does 
not change. This takes 0(1) time, including all needed pointer updates. In the sparse algorithm, 
when moving a group of vertices whose levels change as a result of an arc insertion, we move them 
in decreasing order by new index. In the dense algorithm, we can move such a group of vertices in 
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ai^bitrary order, but we then assign each vertex moved a new index that is less than those of vertices 
previously on the level. As in the sparse algorithm, we can do this by maintaining the smallest index 
and counting down. 

Our third extension explicitly returns a cycle when one is discovered, rather than just reporting 
that one exists. We augment each search to grow a spanning tree represented by parent pointers as 
each search proceeds. In the sparse algorithm, the backward search generates an in-tree rooted at 

V containing all visited vertices; the forwai^d search generates an out-tree rooted at w containing all 
vertices whose level increases. If the backward seaixh causes k{w) to increase to ^(v) + 1 and B 
to become empty, the forward search may visit vertices previously visited by the backward search. 
Each such vertex acquires a new parent when the forward search visits it for the first time. When the 
algorithm stops and reports a cycle, a cycle can be obtained explicitly by following parent pointers. 
Specifically, if the backwai^d search traverses an arc {w,y), following pai^ent pointers from y gives a 
path from y to v, which forms a cycle with (v,w) and {w,y). If the forward search traverses an arc 
{x,y) with y = V or y in B, traversing parent pointers from x and from y gives a path from wtox and 
a path from y to v, which form a cycle with {x,y) and (v,w). In the dense algorithm, there is only 
one tree, an out-tree rooted at v, containing v and all vertices whose level increases. Vertex v has 
one child, w. If the search traverses an arc (x, v), following parent pointers from x gives a path from 

V through (vjw) to x, which forms a cycle with {x,v). 

Our fourth extension is to handle vertex insertions and to allow n and m to be unknown in 
advance. In the sparse algorithm, we insert a vertex v by giving v a level of 1 , decrementing index, 
and giving v an index equal to index. We also maintain a running count of n and m. Each time n 
or m doubles, we recompute A, but we replace A only if it doubles. It is straightforward to verify 
that Theorem |4]remains true. In the dense algorithm, we insert a vertex by giving it a level of 1. We 
also maintain a running count of n. Each time [lgn\ increases, we add a corresponding new set of 
bounds and counts. Theorem [TTlremains true. We can combine this extension with any of the other 
extensions in this section, and with the extension described in the next section. 

5 Maintenance of Strong Components 

A less straightforwai^d extension of our algorithms is to the maintenance of strong components. 
This has been done for some of the earlier algorithms by previous authors. Pearce lITTl and Pearce 
and Kelly ifTSi sketched how to extend their incremental topological ordering algorithm and that of 
Marchetti-Spaccamela et al. [161 to maintain strong components; HKMST showed in detail how to 
extend their algorithms. Our strong component extensions follow the approach of HKMST. 

Our extended algorithms store the vertex sets of the strong components in a disjoint set data 
structure |[26l . Such a data structure represents a partition of a set into disjoint subsets. Each subset 
has a canonical element that names the set. Initially all subsets are singletons; the unique element 
in each subset is its canonical element. The data structure supports two operations: 

Find (x) : Return the canonical element of the set containing element x. 

LWK{x,y): Form the union of the sets whose canonical elements are x and y, with the union 
having canonical element x. This operation destroys the old sets containing x and y. 

We store each set as a tree whose nodes are the elements of the set, with each element having a 
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pointer to its parent. If we do Finds using path compression, and we do LiNKing by rank or size, 
then the total time for any number of Find and Link operations on a partition of n elements is 
0{n\ogn) plus 0(1) per operation |[26l . In our application the number of set operations is 0(1) per 
arc examined, so the time for the set operations does not increase the asymptotic time bound. 

5.1 An Extension for Sparse Graphs 

Our extension of the sparse algorithm maintains for each component a level, an index, a set of arcs 
{x,y) such that x is in the component, and a set of arcs {x,y) such that y is in the component and 
the components containing x andj are on the same level. We store the level, index, and incident 
aix sets with the canonical vertex of the component. Initially every vertex is in its own component, 
all components are on level 1, the components have distinct indices between —1 and —n, inclusive, 
all the incident arc lists are empty, and index, the smallest index, is —n. Each search generates a 
spanning tree, represented by parent pointers: if x is a canonical vertex, p{x) is its parent; if x is a 
root, p{x) = X. Initially all parents are NULL. The algorithm marks canonical vertices it finds to be 
in a new component. Initially all vertices are unmarked. 

The algorithm adds a new arc to the appropriate incident arc lists only if its ends ai^e in different 
components after its insertion. The backwai^d searches delete aixs whose ends are in the same 
component, as well as the second and subsequent arcs between the same pair of components. To 
facilitate the latter deletions, it uses a bit matrix M indexed by vertex. Initially all entries of M are 
zero. 

The algorithm for inserting a new arc (v,w) consists of the following steps: 
Step 1 (test order): Let u = Find(v) and z = Find(h'). \fk{u),i{u) < k{z),i{z), go to Step 6. 
Step 2 (search backward): Let B = [],F = [], arcs = 0, p{u) = u, and p{z) = Z- Do EBvisit(m), 
where mutually recursive procedures EBviSiT and EB TRAVERSE are defined as follows: 

EBvisit(?): For {x,y) £ in{t) do EBtraverse(x, j). LetB = S&[f]. 

EB TRAVERSE (xjj): If Find(x) = Find(j), or M(Find(x),Find(3;)) = 1, delete {x,y) from 
/?i(Find(3;)) and from oMf(FlND(x)). Otherwise, do the following. Let M(Find(x),Find(3;)) = 1 
and arcs = arcs + \. If arcs > A, let k{z) = k{u) + 1, let in{z) = {}, let B = [], unmark any canonical 
vertices marked as being in a new component, make all parents null except that of z, reset M to 
zero, and go to Step 3. If Find(x) = z, mark z as being in a new component. If Find(x) is marked, 
follow parent pointers from Find(j), marking every canonical vertex reached (including Find(j)) 
as being in a new component, until marking u or reaching a marked vertex. If ;7(Find(x)) = NULL, 
let p(Find(x)) = FiND(y) and do EBvisit(Find(x)). 

If the search ends before traversing at least A arcs, test whether k{z) = k{u). If so, go to Step 4; 
if not, let k{z) = k{u) and in{z) = {}■ 

Step 3 (search forward): Do EFvisit(z), where mutually recursive procedures EFviSiT and EF- 
TRAVERSE are defined as follows: 

EFvisit(?): For {x,y) 'mout{t) do EFtraverse(x,3;). LetF = [t]&.F. 

EFtraverse(x,j): If Find(x) = Find(j), delete (x,j) from oMf(FiND(x)). If FiND(y) = u 
or Find(j) is on B, follow parent pointers from Find(j), marking each canonical vertex reached 
(including FlND();)), until mai^king u or reaching a marked vertex. If Find(3') is mai^ked, follow 
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parent pointers from Find(x), marking each canonical vertex reached (including Find(x)), un- 
til marking z or reaching a marked vertex. If ^(Find(3;)) < k{z), let /7(Find(3;)) = Find(;c), let 
yt(FlND(3;)) = k{z), let m(FlND(3;)) = {}, and do EF VISIT (Find (j)). If it(FlND(3;)) = k{z), add 
(x,y) to in(FlND{y)). 

Step 4 (form component): Let L = B&F. If z is marked, combine the old components containing 
the mai^ked vertices into a single new component with canonical vertex z by uniting the incoming 
and outgoing arc sets of the marked vertices, and uniting the vertex sets of the components using 
Unite. Delete from L all marked vertices. Unmark all marked vertices. 

Step 5 (re-index): While L is non-empty, let index = index — 1, delete the last vertex x on L, and let 

i{x) = index. 

Step 6 (add arc): If Find(v) / Find(w), add (v,w) to out{u) and, if k{u) = k{z), to in{z). 

In the proofs to follow we denote levels and indices just before and just after the insertion of an 
arc (v, w) by unprimed and primed values, respectively. 

Theorem 13 The extended sparse algorithm is correct. That is, it correctly maintains the strong 
components, all the data structures, and the following invariant on the levels and indices: if {x,y) 
is an arc, either Find{x) =Find(j) or A:(FlND (a;)), /(FIND (x) < A:(Find(j)),/(Find(j)). 

Proof. The proof is by induction on the number of arc insertions. Initially all the data structures 
are conect. It is straightforward to verify that the algorithm correctly maintains them, assuming 
that it coiTcctly maintains the strong components and the desired invariant on levels and indices. 
Suppose the strong components are correct and the invariant holds before the insertion of an arc 
(v,w). If this insertion does not create a new component, then the algorithm does the same thing as 
the unextended algorithm, except that it operates on components instead of vertices. Thus after the 
insertion the components are correct and the invariant holds. 

Suppose on the other hand that the insertion of {v,w) creates a new component. Until a vertex 
is marked, the algorithm does the same thing as the unextended algorithm, except that it operates 
on components instead of vertices. Thus it will mark at least one vertex. We consider three cases: 
k'{z) = k{z); k'{z) = k{u) > k{zy, k'{z) = k{u) + 1 > k{z). 

If k'{z) = k{z), then k{z) = k{u), the insertion changes no levels, and the backward search fin- 
ishes without traversing at least A arcs. An old canonical vertex x is in the new component if and 
only it is on a simple path from z to u. All components along such a path must have level k{u). An 
induction shows that the backward search marks a canonical vertex if and only if it is on such a path. 
After Step 2, B contains the canonical vertices on level k{u) from which there is a path to u avoiding 
the old component containing z. The algorithm skips the forward search. It correctly forms the new 
component in Step 4 and deletes from L = B all old canonical vertices in the new component except 
z. This includes u. The canonical vertices remaining in L at the end of Step 4 ai^e not reachable 
from z. It follows that Step 5 restores the invariant that if {x,y) is an arc, Find(x) = FlND(y) or 
yt(FlND(x)),/(FlND(x)) < A:(FlND(3;)),/(FlND(y)). 

A similar argument applies if k'{z) = k{u) + 1 > k{z). In this case B = [] in Step 3. At the end of 
Step 3, F contains all old canonical vertices reachable from z by a path through components whose 
old levels are at most k{u). This includes u. An old canonical vertex x is in the new component if 
and only it is on a simple path from z to u. All components along such a path must have old level 
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at most k{u). An induction shows that the forward search maifc a canonical vertex if and only if 
it is on such a path. It follows that the algorithm coiTectly forms the new component in Step 4 and 
restores the invariant on levels and indices in Step 5. 

The remaining case, k'{z) = k{u) > k{z), is the most interesting: the backward search runs out 
of arcs to traverse, and there is a forward search. After Step 2, B contains all old canonical vertices 
from which u is reachable by a path through components of old level k{u). After Step 3, F contains 
all canonical vertices reachable from z by a path through components with old levels less than k{u). 
Thus B and F are disjoint. An old canonical vertex x is in the new component if and only if it is 
on a simple path from z to u. Such a path passes through components in non-increasing order by 
old level. These components have canonical vertices in F or B, with those in F first. An induction 
shows that Steps 2 and 3 mark a canonical vertex if and only if it is on such a path. It follows that 
the algorithm correctly forms the new component in Step 4 and restores the invariant on levels and 
indices in Step 5. □ 

Lemma[2]remains true for the extended algorithm. To bound the running time, we need to prove 
Lemma[3]for the extension. This requires some definitions. We call an arc {x,y) live if x and y are in 
different strong components and dead otherwise. A newly inserted aix that forms a new component 
is dead immediately. The level of a live wc {x,y) is k{PmT>{x)). The level of a dead arc is its 
highest level when it was live; an arc that was never live has no level. We identify each connected 
component with its vertex set; an arc insertion either does not change the components or combines 
two or more components into one. A component is live if it is a component of the cuiTcnt graph and 
dead otherwise. The level of a live component is the level of its canonical vertex; the level of a dead 
component is its highest level when it was live. A vertex and a component are related if there is a 
path that contains the vertex and a vertex in the component. The number of components, live and 
dead, is at most 2n — \. 

Lemma 14 In the extended sparse algorithm, no vertex level exceeds min 

Proof. We claim that for any level k> \ and any level j < k, any canonical vertex of level k 
is related to at least A arcs of level j and at least \/A components of level j. We prove the claim 
by induction on the number of arc insertions. The claim holds vacuously before the first insertion. 
Suppose it holds before the insertion of an arc (v,w). Let u = Find(v) and z = Find(w) before 
the insertion. A vertex is reachable from z after the insertion if and only if it is reachable from z 
before the insertion. The insertion increases the level only of z and possibly of some vertices and 
components reachable from z. It follows that the claim holds after the insertion for any canonical 
vertex not reachable from z- 

Consider a vertex y that is reachable from z and is canonical after the insertion. Since level order 
is topological, k'{y) > k'{z)- For j such that k'{z) < J < k'{y), y is related to at least A arcs of level j 
and Va components of level j before the insertion. None of these arcs or components changes level 
as a result of the insertion, so the claim holds after the insertion for y and level j. Since any arc or 
component of level less than k'{z) that is related to z is also related to y, the claim holds for y after 
the insertion if it holds for z- 

After the insertion, z is reachable from u. Also, k{u) < k'{z) < k{u) + 1. The claim holds for 
u before the insertion. Let {x,y) be an arc of level less than k{u) that is related to u before the 
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insertion. If x is reachable from z, {x,y) will be dead after the insertion and hence its level will not 
change. Neither does its level change if x is not reachable from z- Arc {x,y) is related to z after the 
insertion. Consider a component of level less than k{u) that is related to u before the insertion. If 
the component is reachable from z, it is dead after the insertion of (v, w) and hence does not change 
level; if it is not reachable from z, it also does not change level. After the insertion, the component 
is related to z- It follows that the claim holds for z and any level j <k{u). 

One case remains: j = k{u) < k'{z) = k{u) + 1. For the level of z to increase to k{u) + 1, the 
backward seaixh must traverse at least A aixs of level k{u) before the insertion, each of which is 
related to z and on level k{u) after the insertion. The ends of these arcs are in at least a/A components 
of level k{u), each of which is related to z and on level k{u) after the insertion. Thus the claim holds 
for z and level k{u) after the insertion. This completes the proof of the claim. 

The claim implies that for every level other than the maximum, there are at least A different 
aixs and a/A different components. Since there are only m arcs and at most In — \ components, the 
maximum level is at most min | m/ A , 2?i/ \/A | + 1 . The lemma follows. □ 

Theorem 15 The extended sparse algorithm takes 0(min |m^/^, } ni) time for m arc insertions. 

Proof. The proof is like the proof of Theorem |4l using Lemma [TH □ 

The space required by the extended algorithm is 0{n^), since the bit matrix M requires 0{n^) 
space (or less if bits are packed into words). If we store M in a hash table, the space becomes 0{m) 
but the algorithm becomes randomized. By using a three-level data structure [[271 to store M we can 
reduce the space to 0{n'^^^ +m) without using randomization. We obtain a simpler algorithm with 
a time bound of 0{m^l^) by eliminating the deletion of multiple aixs, thus avoiding the need for M, 
and letting A = m^/^. If we run this simpler algorithm until m > n^/^ , then start over with all vertices 
on level one and indexed in topological order and run the more-complicated algorithm with M stored 
in a three-level data structure, we obtain a deterministic algorithm running in C?(min {m'/^, n^/^ } m) 
time and 0{m) space. 



5.2 An Extension for Dense Graphs 

Our extension of the dense algorithm does two searches per arc addition, the first to find cycles, 
the second to update levels, bounds, and counts. The levels, bounds, counts, and arc heaps are of 
components, not vertices. We store these values with the canonical vertices of the components. 
Initially each vertex is in its own component, all levels are one, all bounds and counts are zero, and 
all heaps are empty. The algorithm deletes aics with both ends in the same component, as well as 
the second and subsequent arcs between the same pair of components. As in the sparse extension, 
to do the latter it uses a bit matrix M indexed by vertex, initially identically zero. 

To insert an arc (v,w), let u = Find(;c) and z = FlND(y). If k{u) < k{z), add (v,w) to out{u) 
with priority k{z). If k{u) > k{z) and m 7^ z, do Steps 1-4 below. (If u = z do nothing.) 
Step 1 (search for cycles): Let S = {{v,w)}. Mark u. Repeat the following step until S is empty: 

Cycle Traversal: Delete some arc {x,y) from S and add it to A. If Find(3') is marked, follow 
parent pointers from Find(;c), marking each canonical vertex reached, until reaching a previously 
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marked vertex. If A:(Find(3;)) < k(u), let ^(Find(3;)) = k{u), let p(FlND(j)) = Find(;c), and delete 
from om?(Find(3')) all aixs with priority at most k{u) and add them to S. 

Step 2 (form component): If z is marked, unite the components containing the marked canonical 
vertices into a single new component whose canonical vertex is z- Form the new arc heap of z by 
melding the heaps of the marked vertices, including z- Unmark all marked vertices. 
Step 3 (update levels, bounds, and counts): Repeat the following step until A is empty: 

Update Traversal: Delete some arc {x,y) from A. If Find(;c) / FlND(y) andM(FlND(;c),FlND(j)) = 
0, proceed as follows. Let M(Find(a;),Find(j)) = 1. If A:(Find(.x;)) > A:(Find(j)), increase 
A:(Find(j)) toA:(FlND(x)) + l; otherwise, let/= [lg(^(FlND(3;)) - ^(Find(;c)))J , add one to c,-(Find(j)), 
and if c,- (Find (j) ) = 3 * 2'+ ^ set c,- (Find (j) ) = 0, set k (Find {y)) = max { )k (Find (j) ) , Z?,- (Find (3;) ) + 3 * 2' } , 
and set ft,(FlND(y)) = ^(Find(j)) — 2'+^ Delete from oMf(FlND(y)) each arc with priority at most 
A:(FlND(y)) and add it to A. Add {x,y) to out{FmD{x)) with priority A:(FlND(y)). 
Step 4 (reset M): Reset each 1 in M to 0. 

Theorem 16 The extended dense algorithm correctly maintains strong components and the in- 
equality k{v) < size{v) for every vertex v. 

Proof. The proof is by induction on the number of arc insertions. The theorem holds initially. 
Suppose it holds before the insertion of an arc {v,w). Let u = Find(v) and z = Find(w) before 
the insertion. If m = z or k{u) < k{z), the theorem holds after the insertion. Suppose u z and 
k{u) > k{z), so that Steps 1-4 are executed. Step 1 visits all vertices of level less than k{u) reachable 
from z and increases their level to k{u). Since z is reachable from u after the insertion of (v,w), all 
such vertices have size at least k{u) after the insertion. Thus such increases in level maintain the 
inequality between levels and sizes. If the insertion of (v,w) creates a new component, the vertices 
in the component are exactly those on paths from z to u, all of which must have level less than 
k{u) before the insertion. Thus Step 1 will visit all such vertices other than u and mark all of them, 
including u, and Step 2 will correctly form the new component. Step 3 updates levels, bounds, and 
counts exactly as in the unextended algorithm except that it operates on components, not vertices, 
and it traverses no arcs with both ends in the same component and at most one arc between any 
pair of components. The proof of Lemma |7] extends to show that Step 3 maintains the inequality 
between levels and sizes. Thus the theorem holds after the insertion. □ 

Theorem 17 The extended dense algorithm runs in 0{n^\ogn) total time. 

Proof. The proof of Lemma|9]extends to show that the extended dense algorithm does O(n^Iogn) 
arc traversals, from which the theorem follows. □ 

The space required by the extended dense algorithm is 0{n^), or 0{n\ogn-\-m) if the heaps and 
the matrix M are stored in hash tables. 

6 Concluding Remarks 

We have presented two algorithms for incremental cycle detection and related problems, one for 
sparse graphs and one for dense graphs. Their total running times are C?(min|m^/^,«^/^}m) 
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and 0{n^logn), respectively. The sparse algorithm is faster for graphs whose density m/n is 
o(?i^/^log?i); the dense algorithm is faster for graphs of density (o(«'/^ logn). The 0{n^^^m) bound 
of the sparse algorithm is best only for graphs with density in the sliver from (o(?i^/^) to o(«^/^log«). 
The HKMST paper gives a lower bound of for algorithms that do vertex updates only 

within the so-called "affected region," the set of vertices that are definitely out of order when a new 
arc is added. Unlike previous algorithms, our algorithms do not do updates completely within the 
affected region, yet they do not beat the HKMST lower bound, and we have no reason to believe 
it can be beaten. On the other hand, for graphs of intermediate density our bounds are far from 
0{nm^/^), and perhaps improvements coming closer to this bound are possible. 

Another interesting research direction is to investigate whether batch arc additions can be han- 
dled faster than single arc additions (other than by reverting to a static algorithm if the batch is large 
enough). See L3u20il. One may also ask whether arc deletions, instead of or in addition to insertions, 
can be handled. Our algorithms remain correct if arcs can be deleted as well as inserted, but the time 
bounds are no longer valid, and we have no interesting bounds. Maintaining strong components as 
arcs are deleted, or as arcs are inserted and deleted, is an even more challenging problem. See |[22l 
and the references contained therein. 
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